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' This paper considers a random access system where each sender can be in two modes of operation, 

active or not active, and where the set of active users is available to a common receiver only. Active 
transmitters encode data into independent streams of information, a subset of which are decoded by the 
receiver, depending on the value of the collective interference. The main contribution is to present an 
information-theoretic formulation of the problem which allows us to characterize, with a guaranteed gap 
q | to optimality, the rates that can be achieved by different data streams. 

Our results are articulated as follows. First, we exactly characterize the capacity region of a two- 
user system assuming a binary-expansion deterministic channel model. Second, we extend this result 



X 



to a two-user additive white Gaussian noise channel, providing an approximate characterization within 



bit of the actual capacity. Third, we focus on the symmetric scenario in which users are active 
\ with the same probability and subject to the same received power constraint, and study the maximum 

achievable expected sum-rate, or throughput, for any number of users. In this case, for the symmetric 
binary expansion deterministic channel (which is related to the packet collision model used in the 



networking literature), we show that a simple coding scheme which does not employ superposition 
coding achieves the system throughput. This result also shows that the performance of slotted ALOHA 
systems can be improved by allowing encoding rate adaptation at the transmitters, achieving constant 
(rather than zero) throughput as the number of users tends to infinity. For the symmetric additive white 
Gaussian noise channel, we propose a scheme that is within one bit of the system throughput for any 
value of the underlying parameters. 
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I. Introduction 

Random access is one of the most commonly used medium access control schemes for channel sharing 
by a number of transmitters. Despite decades of active research in the field, the theory of random access 
communication is far from complete. What has been notably pointed out by Gallager in his review 
paper more than two decades ago |[T2l is still largely true: on the one hand, information theory provides 
accurate models for the noise and for the interference caused by simultaneous transmissions, but it ignores 
random message arrivals at the transmitters; on the other hand, network oriented studies focus on the 
bursty nature of messages, but do not accurately describe the underlying physical channel model. As 
an example of the first approach, the classic results by Ahlswede {3} and Liao lfT5l provide a complete 
characterization of the set of rates that can be simultaneously achieved communicating over a discrete 
memoryless (DM) multiple access channel (MAC). But the coding scheme they develop assumes a fixed 
number of transmitters with continuous presence of data to send. As an example of the second approach, 
Abramson's classic collision model for the ALOHA network @ assumes that packets are transmitted at 
random times and are encoded at a fixed rate, such that a packet collision occurs whenever two or more 
transmitters are simultaneously active. The gap between these two lines of research is notorious and well 
documented by Ephremides and Hajek in their survey article iPTOl . 

In this paper, we try to bridge the divide between the two approaches described above. We present 
the analysis of a model which is information-theoretic in nature, but which also accounts for the random 
activity of users, as in models arising in the networking literature. We consider a crucial aspect of random 
access, namely that the number of simultaneously transmitting users is unknown to the transmitters 
themselves. This uncertainty can lead to packet collisions, which occur whenever the underlying physical 
channel cannot support the transmission rates of all active users simultaneously. However, our viewpoint 
is that the random level of the interference created by the random set of transmitters can also be exploited 
opportunistically by allowing transmission of different data streams, each of which might be decoded or 
not, depending on the interference level at the receiver. 

To be fair, the idea of transmitting information in layers in random access communication is not new; 
however an information-theoretic perspective of this layering idea was never exposed. Previously, Medard 
et al. ifTTl studied the performance of Gaussian superposition coding in a two-user additive white Gaussian 
noise (AWGN) system, but did not investigate the information-theoretic optimality of such a scheme. In 
the present work, we present coding schemes with guaranteed gaps to the information-theoretic capacity. 
We do so under different channel models, and also extending the treatment to networks with a large 
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number of users. Interestingly, it turns out that in the symmetric case in which all users are subject to 
the same received power constraint and are active with the same probability, superposition coding is not 
needed to achieve up to one bit from the throughput of an AWGN system. 

The paper is organized in incremental steps, the first ones laying the foundation for the more complex 
scenarios. Initially, we consider a two-user random access system, in which each sender can be in two 
modes of operation, active or not active. The set of active users is available to the decoder only, and 
active users encode data into two streams: one high priority stream ensures that part of the transmitted 
information is always received reliably, while one low priority stream opportunistically takes advantage 
of the channel when the other user is not transmitting. Given this set-up, we consider two different 
channel models. First, we consider a binary-expansion deterministic (BD) channel model in which the 
input symbols are bits and the output is the binary sum of a shifted version of the codewords sent by the 
transmitters. This is a first-order approximation of an AWGN channel in which the shift of each input 
sequence corresponds to the amount of path loss experienced by the communication link. In this case, 
we exactly characterize the capacity region and it turns out that senders need to simultaneously transmit 
both streams to achieve capacity. Second, we consider the AWGN channel and present a coding scheme 
that combines time-sharing and Gaussian superposition coding. This turns out to be within \/3/2 bit 
from capacity. Furthermore, we also show that in the symmetric case in which both users are subject to 
the same received power constraint, superposition coding is not needed to achieve up to \/3/2 bit from 
capacity. 

Next, we consider an m-user random access system, in which active transmitters encode data into 
independent streams of information, a subset of which are decoded by a common receiver, depending 
on the value of the collective interference. We cast this communication problem into an equivalent 
information-theoretic network with multiple transmitters and receivers and we focus on the symmetric 
scenario in which users are active with the same probability p, independently of each other, and are 
subject to the same received power constraint, and we study the maximum achievable expected sum-rate 
— videlicet throughput. Given this set-up, we again consider the two channel models described above. 
First, we consider the BD channel model in the symmetric case in which all codewords are shifted by the 
same amount. In this setting, input and output symbols are bits, so that the receiver observes the binary 
sum of the codewords sent by the active transmitters. The possibility of decoding different messages in 
the event of multiple simultaneous transmissions depends on the rate at which the transmitted messages 
were encoded. Colliding codewords are correctly decoded when the sum of the rates at which they were 
encoded does not exceed one. This is a natural generalization of the classic packet collision model widely 
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used in the networking literature, where packets are always encoded at rate one, so that transmissions are 
successful only when there is one active user. We present a simple coding scheme which does not employ 
superposition coding and which achieves the throughput. The coding scheme can be described as follows. 
When p is close to zero, active transmitters ignore the presence of potential interferers and transmit a 
stream of data encoded at rate equal to one. By doing so, decoding at the receiver is successful if there is 
only one active user, and it fails otherwise. This is what happens in the classic slotted ALOHA protocol, 
for which a collision occurs whenever two or more users are simultaneously active in a given slot. In 
contrast, when p is close to one, the communication channel is well approximated by the standard m-user 
binary sum DM-MAC, for which the number of transmitters is fixed and equal to m. In this regime, 
active users transmit a stream of data encoded at rate equal to -j-, that is, each active user requests an 
equal fraction of the m-user binary sum DM-MAC sum-rate capacity. Any further increase in the per-user 
encoding rate would result in a collision. When p is not close to either of the two extreme values, based 
on the total number of users m and the access probability p, transmitters estimate the number of active 
users by solving a set of polynomial equations. If k is the inferred number, then transmitters send one 
stream of data encoded at rate \, that is, each user requests an equal fraction of the fc-user binary sum 
DM-MAC sum-rate capacity. Interestingly, it turns out that the estimator needed to achieve the throughput 
is different from the maximum-likelihood estimator [mp\ for the number of active users. The analysis 
also shows that the performance of slotted ALOHA systems can be improved by allowing encoding rate 
adaptation at the transmitters. In fact, we show that the expected sum-rate of our proposed scheme tends 
to one as m tends to infinity. Hence, there is no loss due to packet collisions in the so called scaling 
limit of large networks. This is in striking contrast with the well known behavior of slotted ALOHA 
systems in which users cannot adjust the encoding rate, for which the expected sum-rate tends to zero as 
m tends to infinity. In practice, however, medium access schemes such as 802.1 lx typically use backoff 
mechanisms to effectively adapt the rates of the different users to the channel state. It is interesting to note 
that while these rate control strategies used in practice are similar to the information-theoretic optimum 
scheme described above for the case of equal received powers, practical receivers typically implement 
suboptimal decoding strategies, such as decoding one user while treating interference as noise. 

Next, we consider the case of the m-user AWGN channel. For this channel, we present a simple coding 
scheme which does not employ superposition coding and which achieves the throughput to within one 
bit — for any value of the underlying parameters. Perhaps not surprisingly, this coding scheme is very 
similar to the one described above for the case of the BD channel. In fact, the close connection between 
these two channel models has recently been exploited to solve capacity problems for AWGN networks 
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through their deterministic model counterpart ||5]. 

Finally, we wish to mention some additional related works. Extensions of ALOHA resorting to prob- 
abilistic models to explain when multiple packets can be decoded in the presence of other simultaneous 
transmissions appear in lTT3l and |[T9l . An information-theoretic model to study layered coding in a two- 
user AWGN-MAC with no channel state information (CSI) available to the transmitters was presented in 
a preliminary incarnation of this work lfl8l . The two-user BD channel has been studied in the adaptive 
capacity framework in fffl and in this paper we also provide a direct comparison with that model. We 
also rely on the broadcast approach which has been pursued in GUI , and E2l to study multiple access 
channels with no CSI available. A survey of the broadcast approach and its application to the analysis 
of multiple antenna systems appeared in ETTl . and we refer the reader to this work and to for an 
overview of the method and for additional references. The DM-MAC with partial CSI was studied in [|8] 
assuming two compressed descriptions of the state are available to the encoders. 

The rest of the paper is organized as follows. The next section formally defines the problem in the case 
of a two-user AWGN random access system. Section [V] presents the extension to of the m-user random 
access system assuming an additive channel model. Section [Vl] consider the case of a BD channel 
model, while section [VTT1 deals with the AWGN channel. A discussion about practical considerations and 
limitations of our model concludes the paper. 

II. The two-user Additive Random Access Channel 

Consider a two-user synchronous additive DM-MAC where each sender can be in two modes of 
operation, active or not active, independently of each other. The set of active users is available to the 
decoder, while encoders only know their own mode of operation. This problem is the compound DM-MAC 
with distributed state information depicted in Fig. [T] Specifically, the state of the channel is determined 
by two statistically independent binary random variables Si and 5*2, which indicate whether user one 
and user two, respectively, are active, and it remains unchanged during the course of a transmission. 
Each sender knows its own state, while the receiver knows all the senders' states. The presence of side 
information allows each transmitter to adapt its coding scheme to its state component. We can assume 
without loss of generality that senders transmit a codeword only when active, otherwise they remain 
silent. 

Each sender transmits several streams of data, which are modeled via independent information mes- 
sages, a subset of which is decoded by the common receiver, depending on the state of the channel. The 
notation we use is as follows. We denote by Wi = {Wi,i, • • • , Wi,|Wi|} anc * ^2 = {W2,i) • • • , W2JW2I} 



December 16, 2009 



DRAFT 



6 




Fig. 1. The two-user MAC with partial CSI modeling random access communications. The variables in bold represent vectors 
of length n. 



the ensemble of independent messages transmitted by user 1 and user 2, respectively. We assume that 
each message Wij is a random variable independent of everything else and uniformly distributed over 
a set with cardinality 2 nRz ' j , for some non-negative rate Rij, j G {1, . . . , |Wj|}, i G {1,2}. We let 
yVi(A) C Wi denotes the set of messages transmitted by user j, i £ {1,2}, that are decoded when the 
set of senders A C {1,2} is active. Finally ri(A) denotes the sum of the rates at which messages in 
Wi(A) are encoded. 

Therefore, we can distinguish three non-trivial cases: if user 1 is the only active user, then the receiver 
decodes the messages in Wi({l}) and the transmission rate is equal to ri({l}); similarly, if user 2 is 
the only active user, then the receiver decodes the messages in W2({2}), which are encoded at total rate 
of r2({2}); finally, the receiver decodes messages in Wi({l,2}) and VV2({1,2}) when both users are 
active, so senders communicate at rate n({l, 2}) and ^({l, 2}), respectively. The resulting information- 
theoretic network is illustrated in Fig. [2l where one auxiliary receiver is introduced for each channel 
state component. It is clear from the figure that, upon transmission, each transmitter is connected to the 
receiver either through a point-to-point link or through an additive DM-MAC, depending on the channel 
state. 

Observe that if the additive noises in Fig. [2] have the same marginal distribution, then the channel 
output sequence observed by the MAC receiver is a degraded version of the sequence observed by each 
of the two point-to-point receivers, because of the mutual interference between the transmitted codewords. 
As in a degraded broadcast channel, the "better" receiver can always decode the message intended for 
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Fig. 2. The network used to model a two-user random access system. The subscript index in Y and in Z denote the set 
of active users, Wi({l,2}) and Wi({l,2}) encode information which is always decoded, while Wi({l}) \ Wi({l,2}) and 
W2({2}) \ W2({1, 2}) denote messages which are decoded when there is no interference. 

the "worse" receiver, similarly here each point-to-point receiver can decode what can be decoded at the 
MAC receiver. Thus, there is no loss of generality in assuming that 

Wi({l,2})CWi({l}) (1) 

and that 

W 2 ({1,2}) C W 2 ({2}). (2) 

Then, messages in Wi({l,2}) and Wi({l,2}) ensure that some transmitted information is always 
received reliably, while the remaining messages provide additional information that can be opportunis- 
tically decoded when there is no interference. If conditions £T|) and © are satisfied, then we say that 
W = ({Wi,W 2 },{Wi({l}), Wi({l,2}), W 2 ({2}), W 2 ({1,2})}) denotes a message structure for the 
channel in Fig. [2] 

For a given message structure W, we say that the rate tuple (ri({l}), r 2 ({2}), r\({l, 2}), r 2 ({1, 2})) 
is achievable if there exist a sequence of coding and decoding functions such that each receiver in Fig. |2] 
can decode all intended messages with arbitrarily small error probability as the coding block size tends 
to infinity. We define the capacity region Sf^ as tne closure of the set of achievable rate tuples. 

Observe that as we vary |Wi|, |W 2 |, and the sets of decoded messages, there are infinitely many 
possible message structures for a given channel. For each one of them we define Sow- 

Next, we define the capacity of the channel in Fig. [2l denoted by So, as the closure of the union of 
Sb°w over all possible message structures W . Note that So represents the optimal tradeoff among the rates 
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(ri({l}), r2({2}), ri({l, 2}), ^({1, 2}) over all possible ways of partitioning information into different 
information messages such that conditions (Q~|) and (0 are satisfied. 

In the next section we answer the question of characterizing 'tf for two additive channels of practical 
interest. First, we consider the BD channel model, for which we completely characterize the capacity 
region c €. Perhaps not surprisingly, we show that to achieve ^ it suffices that each sender transmits 
two independent information messages, one of which carries some reliable information which is always 
decoded, while the remaining one carries additional information which is decoded when the other user 
is not transmitting. Second, we consider the AWGN channel, for which we provide a constant gap 
characterization of ^, where the constant is universal and independent of the channel parameters. Finally, 
we apply this result to the study of the throughput of a two-user random access system under symmetry 
assumptions. 



Suppose that channel input and output alphabets are each the set {0, l} ni , for some integer number 
m, and that at each time unit t G {1, . . . , n} inputs and outputs are related as follows: 



where n 2 < n\ denotes an integer number, summation and product are over GF(2), and S ni ~ n ' 2 denotes 
the (ni — ri2) x (m — n 2 ) shift matrix having the (i,j)th component equal to 1 if i = j + (ni — 712), 
and otherwise. By pre-multiplying X 2f t by S ni ~ n2 , the first 77,2 components of X 2 ,t are down-shifted 
by (rai — 77.2) positions and the remaining elements are set equal to zero. We refer to this model as 
the two-user BD random access channel (RAC), see Fig. [3] for a pictorial representation. Physically, 
this channel represents a first-order approximation of a wireless channel in which continuous signals are 
represented by their binary expansion, the codeword length n\ represents the noise cut-off value, and 
the amount of shift n\ — n 2 corresponds to the path loss of user 2 relative to use 1 (51 . The following 
theorem characterizes the capacity region of this channel. 

Theorem 1: The capacity region ^of the two-user BD-RAC is the set of non-negative rate tuples such 



III. Example 1: the two-user BD random access channel 



s n i- n *x 2tt , 




(3) 
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that 

ritfl}) < m, 

r 2 ({2}) < n 2 , 
ri({l})+r 2 ({l,2}) < m, 
r 2 ({2})+n({l,2}) < m, 

n({i,2}) < n({i}), 

r 2 ({l,2}) < r 2 ({2}). 

The proof of the converse part of the above theorem can be sketched as follows. Observe that the 
common receiver observing Yi 2 = {5 / i 2 ,i, • • • , ^12,71} can decode messages in W 2 ({1, 2}). Let us suppose 
that this receiver is given messages in W 2 \ W 2 ({1, 2}) as side information. Then, it has full knowledge 
of W 2 , so it can compute the codeword X 2 transmitted by user 2, subtract it from the aggregate received 
signal Y12, obtaining Xi. Thus, given the side information, the channel output observed by the common 
receiver becomes statistically equivalent to Yi. Since receiver 1 can decode Wi({l}) upon observing 
Yi, we conclude that receiver 12 must also be able to decode message W 2 ({1,2}). Hence, ri({l}) + 
r 2 ({l, 2}) < n\. By providing side information about message Wi \ Wi({l, 2}) and following the same 
argument above, we obtain that r 2 ({2}) + ri({l, 2}) < n\. The remaining bounds are trivial. 

The proof of the achievability part of the theorem shows that it suffices to partition information into two 
independent messages, such that Wi = {Wi^, Wi, 2 } and W 2 = {W^ii W2,2}- Messages W± t 2 and M^,2 
represent ensure that part of the transmitted information is always received reliably, while W\ : i and PF 2 ,i 
are decoded opportunistically when one user is not transmitting. The corresponding message structure is 
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illustrated in Fig. [3] In general, the coding scheme which we employ in the proof of the achievability 
requires that user 1 simultaneously transmits W\ t i and W\$. However, in the special symmetric case in 
which n\ = ra 2 all rate tuples in the capacity region can be achieved by means of coding strategies in 
which each user transmits only one of the two messages. 

Proof: First, we prove the converse part of the theorem. The first two inequalities which define ^ are 
standard point-to-point bounds which can be derived via standard techniques. To obtain the third inequal- 
ity, observe that by Fano's inequality we have that H(Wi({l, 2})|Yi 2 ) < ne n , H(W 2 ({1, 2})| Y 12 ) < 
ne n , as well as H(Wi(i)\Yi) < ne n , where e n tend to zero as the block length n tends to infinity. From 
the independence of the source messages, we have that 

n(ri({l})+r 2 ({l,2}))= J ff(W 1 ({l}),W 2 ({l,2})), 

= #(Wi({l}), W 2 ({1, 2})|W 2 \ W 2 ({1, 2})), 
= /(Wi({l}), W 2 ({1,2}); Y 12 |W 2 \ W 2 ({1,2})), 

+ H(Wi({l}), W 2 ({1, 2})| Yi 2 , W 2 \ W 2 ({1, 2})). (5) 

Using the memoryless property of the channel and the fact that conditioning reduces the entropy, the 
first term in the right hand side of ((5]) can be upper bounded as 

I(Wi({l}),W 2 ({l,2});Y 12 |W 2 \W 2 ({l,2})) <nm. (6) 

On the other hand, from the chain rule, the fact that conditioning reduces the entropy, and Fano's 
inequality, we have that 

W 2 ({1, 2})|Yi2, m \ W 2 ({1, 2})) 
= H(m({h 2})|Yi 2 , W 2 \ W 2 ({1, 2})) + ff(Wi({l})|Yia, W 2 ) 

< H(m({h 2})|Y 12 ) + F(Wi({l})|Yi 2) W 2 , X 2 ) 
= if(W 2 ({l,2})|Y 12 ) + £T(V^({l})|Yi) 

< 2ne n (7) 

where the last equality is obtained observing from ((3]) that, if X 2 is given, then Yi 2 is statistically 
equivalent to Yi. Substituting © and (|7]) into ([5]), we have that 

n(n({l})+r 2 ({l,2})) <nn 1 + 2ne n , 
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and the desired inequality is obtained in the limit of n going to infinity. The fourth inequality in (0]) is 
obtained by a similar argument. Finally, the last two inequalities in (0]) follow from (OQ) and (0. 

Next, to prove the direct part of the theorem, we establish that *rf is equal to the capacity of the 
two-user BD-RAC for the specific message structure defined by Wj = {Wi i, Wi^}, VVi({i}) = Wj, and 
Wj({12}) = {Wj^}, i 6 {1, 2}. For this message structure we have that 



We have established above that if (ri({l}), r 2 ({2}), n({l,2}), r 2 ({l,2}) £ C c i, , then inequalities 
dU) have to be satisfied. Combining the non-negativity of the rates, (0]), and ([8]), and eliminating (ri({l}), 
r 2 ({2}), n({l, 2}), r2({l,2}) from the resulting system of inequalities, we obtain 



The above system of inequalities is the image of (0]) under the linear map ([8]). Since the map is invertible, 
proving the achievability of all rate tuples (ri({l}), r2({2}), ri({l,2}), r 2 ({l,2}) satisfying (0]) is 
equivalent to proving the achievability of all rate tuples i, -^2,1, -Ri,2) ^2,2) satisfying (O. It is tedious 
but simple to verify that the set of non-negative rate tuples satisfying (© is equal to the convex hull of ten 
extreme points, four of which are dominated by one of the remaining six. Given two vectors u and v, we 
say that u dominates v if each coordinate of u is greater than or equal to the corresponding coordinate of 
v. The six dominant extreme points of (© are given by vi = [n,2, ra 2 , «i— n 2 , 0] T , v 2 = [n\— 112, 0, 0, ri2] T , 
v 3 = [0,0, m - n 2 ,ni] T , v 4 = [m, n 2 , 0, 0] T , v 5 = [0, 0, m, 0] T , v 6 = [0,0,0, n 2 ] T , where the four 
coordinates denote (i?i,i, -^2,1, -^1,2, ^2,2), respectively. 

The achievability of vi, V6 can be sketched as follows. To achieve vi sender 1 transmit simulta- 
neously W\ t 2 and W\ t i, in the first m — n 2 and last n 2 components of X\, respectively. User 2, instead, 
transmits W21 in the first n 2 components of X 2 . Because of the downshift in X 2 , the multiple access 
decoder receives the binary sum of W\,\ and W2,i in the last n 2 components of Y\2, and can successfully 
decoded W\$ from the first n\ — n 2 interference-free components. Effectively, coding is performed so that 



ri({l}) 
r 2 ({2}) 

n({l,2}) 

r 2 ({l,2}) 



#1,2 + #1,1; 
#2,2 + #2,1 > 
#1,2, 
#2,2- 



(8) 



#1,1 + #1,2 < ni, 

#2,1 + #2,2 < "2, 

#1,1 + #1,2 + #2,2 < ni, 

#2,1 + #1,2 + #2,2 < n l- 



(9) 
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Fig. 4. Pictorial representation of the coding scheme which achieves the rate tuple Rij = ri2, Ri,2 = n\ — 712, J?2,i = fi2, 
i?i ; 2 = 0. Message is transmitted via the first n\ — TI2 interference-free links (dotted lines), while and Wi,\ are sent 
through the remaining 712 links (solid lines), so that the interference they generate results "aligned" at the common receiver. 



W\ \ and Wi,\ are received "aligned" at the common receiver, see Fig. [4] for a pictorial representation. 
Observe that in the special case in which n\ = n^, sender 1 only transmit message W\^- Likewise, 
V2 , . . . , V6 can be achieved by transmitting one message per user, in such a way that the transmitted 
codewords do not interfere with each other at the multiple access receiver. For example, to achieve Vi 
user 2 transmits W22 in the first n,2 components of X2, while user 1 transmits W12 in the first n\ — ri2 
components of X\. 

Next, observe that if an extreme point v is achievable, then all extreme points dominated by v are 
also achievable by simply decreasing the rate of some of the messages. Finally, any point in dU) can be 
written as convex combination of the extreme points, hence it can be achieved by time-sharing among 
the basic coding strategies which achieve vi, . . . , Ve. This shows that all rate tuples satisfying (O are 
achievable. ■ 

A. The throughput in a symmetric scenario. 

Having an exact characterization of the capacity region at hands, it is now possible to formulate and 
solve optimization problems of practical interests. As an example, we consider the problem of maximizing 
the throughput in the symmetric scenario where n\ = 712 = 1, and where each user is independently 
active with probability p. 

This model represents a first-order approximation of a wireless channel in which data arrivals follow 
the same law, and where transmitted signals are received at the same power level. The codeword length is 
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normalized to 1 so that the maximum amount of information which can be conveyed across the channel 
is one bit per channel use, regardless the number of active users. The possibility of decoding different 
messages in the event of multiple simultaneous transmissions depends on the rate at which the messages 
were encoded. Colliding codewords are correctly decoded when the sum of the rates at which they were 
encoded does not exceed one. This is a natural generalization of the classic packet collision model widely 
used in the networking literature, where packets are always encoded at rate one, so that transmissions 
are successful only when there is one active user. The parameter p represents the burstiness of data 
arrivals, and determines the law of the variables S\ and S2 in Fig. [1] hence the channel law. Based on 
the knowledge of p, each sender can "guess" the state of operation of the other user, and optimize the 
choice of the encoding rates so that the expected sum-rate, or throughput, is maximized. 
Formally, we look for the solution of the following optimization problem: 



subject to the constraint that the rates should be in If. Observe that the weight assigned to each rate 
component ri(A) is uniquely determined by p, and is equal to the probability that users in the set A are 
active. By means of Theorem [T] it is easy to show that the solution to the above problem is equal to 



The coding strategy used to achieve the throughput can be described as follows. If the transmission 
probability p lies in the interval (0, 1/2], then user i transmits message W^i encoded at rate 1. A collision 
occurs in the event that both senders are simultaneously transmitting, which occurs with probability p 2 , 
in which case the common receiver cannot decode the transmitted codewords. Decoding is successful 
if only one of the two users is active, so the expected sum-rate achieved by this scheme is equal to 
2p(l — p). If, instead, the transmission probability p lies in the interval (1/2, 1], then user i transmits 
message W^2 encoded at rate 1/2, i.e., at half the sum-rate capacity of the two-user binary additive 
MAC. By doing so, the transmitted codewords are never affected by collisions, and can be decoded in 
any channel state. This yields an expected sum-rate of 2p{l — p)l/2 + p 2 . It should be highlighted that 
in this symmetric scenario each user transmits only one of the two messages for any value of p. 

We show later in the paper that this optimization problem can be solved in the general case of a 
network with more than two users. 



maxp(l - p) [n({l» + r 2 ({2})] + p 2 [n({l, 2}) + r 2 ({l, 2})] 




2p(l-p), if p€ (0,1/2]; 
p, if pe (1/2,1]. 
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IV. Example 2: the two-user AWGN-RAC 

We now turn to another example of additive channels. Assume that at each discrete time step inputs 
and outputs are related as follows: 

Y\,t = Xij + Zij, 

Y 12 ,t = X ljt + X 2 , t + Z 12jt , (10) 
Y 2> t = X 2) t + Z 2i t, 

where Z\^, Z%$, and Z\ 2j t are independent standard Gaussian random variables, and the sum is over 
the field of real numbers. Assume that the realizations of {JQ,t} satisfy the following average power 
constraint 

n 

t=i 

for some positive constant Pj, i = 1, 2, and that Pi > P2. We refer to the model in dTQb as the two-user 
AWGN-RAC. In the rest of the paper, we use the notation C(x) = l/21og(l + x). 

An outer bound to the capacity region ^ of the two-user AWGN-RAC in (TTOl is given by the following 
Theorem. 

Theorem 2: Let ^€ denote the set of non-negative rates such that 



n({i}) 


< 


C(Pi), 


r 2 ({2}) 


< 


C(P 2 ), 


ri({l})+r 2 ({l,2}) 


< 


C(Pi+P 2 


r 2 ({2})+n({l,2}) 


< 


C(Pi+P 2 


1({1,2}) 


< 


n({i}), 


r 2 ({l,2}) 


< 


r 2 ({2}). 



Then, ^C?. 

The proof of the above theorem is similar to the converse part of Theorem Q] and it is hence omitted. 

Next, we prove an achievability result by computing an inner bound to the capacity region of the 
two-user AWGN-RAC for a specific message structure "W . As for the BD-RAC, we let Wj = {Wi,i, W^ 2 }, 
Wi(i) = Wi, and Wj(12) = {VFi )2 }, i G {1,2}. The encoding scheme we use is Gaussian superposition 
coding. Each sender encodes the messages using independent Gaussian codewords having sum-power 
less or equal to the power constraint. Decoding is performed using successive interference cancelation: 
messages are decoded in a prescribed decoding order, treating interference of messages which follow in 
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the order as noise. Then, each decoded codeword is subtracted from the aggregate received signal. 
Proposition 3: Let c €^ / denote the set of non-negative rates such that 

ri({l}) < C(Pi), 
r 2 ({2}) < C(P 2 ), 
n({l})+r 2 ({2}) < C(Px+P 2 ), (12) 

n({l,2}) < n({i}), 

r 2 ({l,2}) =r 2 ({2}). 

Similarly, let ^f'^ denote the set of non-negative rates satisfying (fT2)) after after swapping the indices 1 
and 2. Finally, let denote the set of non-negative rates satisfying the following inequalities 

n({i.2})<c( ft as +1 ), 

^ 2 })<c( a +1 ), 

n({l, 2}) + r 2 ({l, 2}) < C ( (1 t^7a )Pa ) , d3) 

ri({l}) < n ({1,2}) +0(^0, 
r 2 ({2}) < r 2 ({l,2})+C(/3 2 P 2 ). 

for some (ft, fa) £ [0, 1] x [0, 1]. Let ^ r = closure^ U tfjy U ^). Then, f> C ^ C K 

Proof: Suppose that sender two does not transmit message W21, i.e., i? 2 ,i = 0. The achievability 
of j£ 2 can tnen b e shown by using a standard random coding argument as for the AWGN-MAC. To 
send (Wi : 2,W± : i), encoder one sends the sum of two independent Gaussian codewords having sum- 
power equal to Pi. On the other hand, sender two encodes W%-2. into a Gaussian codeword having power 
P 2 . A key observation is that the common receiver observing Y"i 2 can decode all transmitted messages: 
Wi,2> ^2,2 can be decoded by assumption, while W\ \ can be decoded after having subtracted X 2 from 
the received channel output. Thus, by joint typical decoding, decoding is successful with arbitrarily 
small error probability if R 1A + R h2 + i? 2 , 2 < C(Pi + P 2 ), i.e., ri({l}) + r 2 ({2}) < C(P 1 + P 2 ). 
Similarly, the receiver observing Y\ can decode messages W± : 2, W\ \ as long as + i?i j2 < C(Pi), 
i.e., ri({l}) < C(Pi) while the receiver observing Y" 2 can decode messages M^ 2j2 if r 2 ({2}) < C(P 2 ). 
We conclude that ^ 2 * s an inner bound to the capacity region. By swapping the role of user 1 and user 2 
it is easy to see that ^f 2 ' * s a l so an inner bound to the capacity region. We claim that ^f 2 " can be achieved 
by a coding scheme which combines Gaussian superposition coding and multiple access decoding. As 
in the Gaussian broadcast channel, to send the message pair (W^i, Wj^), encoder i sends the codeword 
[W^i, Wi ;2 ) = Uj(Wj j2 ) +Vj(Wj ) i), where the sequences Uj and Vj are independent Gaussian 
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codewords having power (1 — /%)Pj and /3jPj respectively, i = 1,2. Upon receiving Yi 2 , decoder 12 first 
decodes Wi l2 and W2,2 using a MAC decoder and treating Vi(Wi,i) + V 2 (W 2j i) as noise. Decoding 
is successful with arbitrarily small error probability if 

r> ^ n ( (l-ffi)Pi ^ 
^2,2 < C ^ iPi+A p 2+1 j , 

Upon receiving Yj = U^Wj^) + V(Wj j i) + Zj, decoder i performs decoding via successive interference 
cancelation: it first decodes Wj j2 treating Vj(Wi,i) +Zj as noise, then it subtracts Uj(Wj j2 ) from and 
decodes W^i from V^W^i) + Zj. Thus, decoding of Wi i2 is successful if i2j )2 < C ( fel^r ) , while 
decoding of Wj ( i is successful if R^x < C(/?iP). After combining these conditions to the equalities 
which relate (ri({l», r 2 ({2}), ri({l,2}), r 2 ({l,2}) to (R lfl , R 2>1 , R 1>2 , R 2>2 ), and eliminating (R ltl , 
R-2,1, Ri,2, ^2,2) from the resulting system of inequalities, we obtain that (fl4l ) have to be satisfied for 
the above coding scheme to work. Finally, a standard time-sharing argument can be used to show that 
the closure^ U U C <<f 2 ■ 
The following theorem explicitly characterizes the gap between the above inner and outer bounds on c €. 

Theorem 4: Let Then, there exists R' e such that || R - R' ||< 

Proof: See Appendix I. ■ 

Observe Gaussian superposition coding is not the optimal coding strategy for the AWGN channel 
under consideration. However, the above theorem ensures that Gaussian superposition coding achieves 
to within \/3/2 bit from the capacity < €. It is important to note that this bound holds independently of 
the power constraints Pi and P 2 . The proof of the above theorem is established by showing that for any 
extreme point v of < ^ 7 2 , there exists an r G c £_ 2 at distance less that y/S/2 from v. Since any point R in 
^2 is a convex combination of extreme points of ^ 2 , we can employ a time-sharing protocol among the 
various achievable rate points {r} and achieve a rate point at distance less that y/3/2 from R. 

A. An approximate expression for the throughput. 

As an application of the above result, consider the symmetric scenario where Pi = P 2 = P, and where 
each user is active with probability p. Based on the knowledge of p, transmitters optimize the choice 
of the encoding rates so that the throughput is maximized. Formally, we look for the solution of the 
following optimization problem: 

maxp(l ~ P) [ri({l» + r 2 ({2})] + p 2 [n({l, 2}) + r 2 ({l, 2})] 
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subject to the constraint that the rates should be in c £. Combining Theorem [2] and Theorem[3] it is possible 
to show that the above maximum is equal T(p, P) + e(p, P), where 

m { 2p(l-p)C(P), ifp€(0, Pl (P)]; 

pC(2P), ifpG (pi(P),l], 

Pl (P) = 1 - C(2P)/(2C(P)) G (0, 1/2], and < e(p,P) < 1. Observe that the bound on the error term 
holds for any choice of the parameters p and P. 

The coding strategy used to achieve T(p, P) is similar to the one described for the case of the 
symmetric BD channel. If the transmission probability p lies in the interval (0,pi(P)], then user i transmits 
message W^i encoded at the maximum point-to-point coding rate, i.e., C(P). If, instead, the transmission 
probability p lies in the interval (pi(P),l], then each active user transmits message Wip encoded at 
rate 1/2C(2P), i.e., at half the sum-rate capacity of the two-user AWGN-MAC. The parameter pi(P) 
represents a threshold value below which it is worth taking the risk of incurring in a packet collision. 
Observe that pi(P) — > 1/2 as P — > cxd. 

Fig- 13 compares T(p, P) to the expected sum-rate achieved under the adaptive-rate framework lfl4l . and 
to its counterpart assuming that full CSI is available to the transmitters. In the adaptive-rate framework, 
each sender transmits at a rate of C(2P)/2, so that users can always be decoded. The figure illustrates 
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how our approach allows us to improve upon the expected adaptive sum-rate for small values of p, for 
which the collision probability is small. In this regime, our inner bound is in fact close to the curve 
obtained giving full CSI to the transmitters. Later in the paper, we shall see that the gain provided by 
our approach becomes more significant when the population size of the network increases. 



In this section we extend the analysis previously developed for a two-user system to the case of an 
m-user MAC, where m denotes an integer > 2, and in which each transmitter can be in two modes of 
operation, active or not active. The set of active users, denoted in the sequel by A, determines the state 
of the channel. That is, the channel is said to be in state A if all users in the set A are active. As in 
the two-user case, transmitters only know their own state component, and encode data into independent 
streams of information. The common receiver knows the set of active users, and decodes subsets of the 
transmitted data streams depending on the state of the channel. 

By introducing one auxiliary receiver per each channel state, we can map this problem to a broadcast 
network with m transmitters and 2 m — 1 receivers. A one-to-one correspondence exists between the set 
of receivers and the set of non-empty subsets of {1, . . . , m}, so that for each set of active users A, there 
exists a unique corresponding receiver, which with abuse of notation we refer to as receiver A. Receiver 
A observes the sum of the codewords transmitted by users in A plus noise, and decodes a subset of 
the data streams sent by the active users. Observe that for a given channel state, only one among these 
auxiliary broadcast receivers corresponds to the actual physical receiver. 

The formal description of the problem is as follows. 

A. Problem formulation 

Definition 5.1: An m-user DM-RAC ({Xi, . . . , X m }, {y>A ■ A £ {1, • • • , m}}, (p({yA ■ A- £ 
{1, • • • , m}}|xi, . . . , x m )) consists of m input sets X±, . . . , X m , 2 m — 1 output sets {3^4.}> and a collection 
of conditional probabilities on the output sets. 

The channel is additive if at any discrete unit of time t 6 {1, . . . , n}, the input symbols {X\±, . . . , X m j) 
are mapped into 2 m — 1 channel output symbols {Ya,*} via the additive map 



where the {Zaj} are mutually independent random variables with values in a set Z, and the sum is over 
a field F such that there exists m embeddings Fi : Xi — > F, and one embedding F m+ i : Z — > F. In the 



V. The TO-USER additive RAC 




(15) 
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next section we consider two classes of additive random access channels: the symmetric BD-RAC, for 
which the channel inputs are strings of bits, and the sum is binary; and the symmetric AWGN-RAC, for 
which X = Z = E, the channel inputs are subject to an average power constraint, and the sum is over 
the reals. 

Definition 5.2: A message structure W = ({Wi, . . . , W m ,}, {Wi{A) : i £ A C {1, • • • , m}}) for an 
m-user RAC consists of m input message sets W«, Wi = {W^x, • • • , Wj jyy.i}, and m2 m_1 output sets 
yVi(A), Wi(A) C Wi, such that the following condition is satisfied: 
Al. Wi(B) C Wi(A) for all i G A C B C {1, . . . ,m}. 

For each i and j £ {1, . . . , |Wj|}, message Wij is a random variable independent of everything else and 
uniformly distributed over a set with cardinality 2 nRi <' , for some non-negative rate Rij, j £ {1, . . . , |Wj|}. 

The reason for imposing condition Al. is as follows. Observe from (TT5T > that if A C i? and the marginal 
distributions of the noises Z# and are equal, then Ys is a (stochastically) degraded version of Ya- 
Then, condition Al. says that the "better" receiver A must decode what can be decoded at the "worse" 
receiver B. 

For a given message structure W , let 

n{A)= R ^ W 

denote the sum of the rates of the messages in Wi{A). Observe that (fT6l ) defines a linear mapping from 
jj|VVi|x...x|W m | - nto gm2 m 1 shows how a macroscopic quantity, the rate at which user % communicates 
to receiver A, is related to various microscopic quantities, the coding rates of the individual transmitted 
messages. 

Definition 5.3: An n-code for the RAC {{X\, . . . , X m }, {^A : A £ {1, ••• ,m}}, (p({va ■ A ^ 
{1, • • • , m}}|xi, . . . , x m )) and for the message structure W consists of m encoding functions (encoders) 
and 2 m — 1 decoding functions (decoders). Encoder i maps each {W^i,-" , Wjiyy.i} into a random 
codeword Xj = X^2, ■ ■ ■ , ^i,n} of n random variables with values in the set Xi. Decoder A maps 

each channel output sequence £ y\ into a set of indexes Uj.^. £WJA){Wi,j}> where each index 
Wij £ {1, ... , 2 2nRi °i} is an estimate of the corresponding transmitted message Wij £ Wi(A). 

Definition 5.4: For a given n-code, the average probability of decoding error at the decoder A is 
defined as 

Pr{\Vij + Wij : Wij £ Wi(A), j £{!,..., \Wi(A)\},i £ a} . (17) 
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Definition 5.5: A rate tuple {ri(A)} is said to be achievable if there exists a sequence of n codes such 
that the average probability of a decoding error (fT71 ) for each decoder vanishes to zero as the block size 
n tends to infinity. 

Definition 5.6: The capacity region of the m-user RAC ({Xi, . . . , X m }, {y*A '■ A £1 {1, • • • , to}}, 
(p({ua - A £1 {1, • • • , m}}|xi, . . . , x m )) for the message structure W is closure of the set of achievable 
rate vectors {r.i{A)}. 
Finally, 

Definition 5.7: The capacity region ^ of the m-user RAC ■ ■ ■ , X m }, {y A : A $1 {1, • • • , m}}, 

(p({yA ■ A <^ {!,-■■ ,m}}\xx,...,x m )) is defined as 

'W= closure(U#/^V). 



B. An outer bound to the capacity ^ 

Theorem 5: The capacity region ^ of the additive m-user additive RAC in ( fT5l ) is contained inside 
the set of non-negative rate tuples satisfying 

n{B) < n(A) for alii G B C A, (18) 

and 

2 ({i x . . . i k }) < I(X h ,...,X iK ; Y ilmiK ), (19) 

k=l 

for all K G {1, ... , m} and «i / . . . / i m G {1, . . . , m}, and some joint distribution p(q)p(xi\q) ■ ■ -p(x m |(?), 
where \Q\ < ml x m. 

Proof: See Appendix II. ■ 
Remark 1: In the special case of a network with two users, it is immediate to verify that the outer 

bound given by the above theorem reduces to the region given by Theorem Q] and Theorem [2] for the 

two-user BD-RAC and the two-user AWGN-RAC, respectively. 

Remark 2: An inspection of the proof of the above theorem shows that the additive channel model 

assumed in the theorem can be replaced with a more general family of maps, namely with those channels 

with the property that, if Xyi' is given, then is statistically equivalent to Y A \ A ,, A' C A. 
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Remark 3: Observe that ( fl9l ) gives m constraints for any permutation of the set {1, . . . ,m}, so it 
defines m x ml inequalities. 

Equation ( fT9l ) can be obtained as follows. Suppose that we fix a set of active users 
for some K G {1,... ,m}, and we provide the receiver observing Y^...^ with messages in the set 
U^ =1 Wi K _ r+1 \ yVi K _ r+1 ({i\ . . .iK-r+i}) as side information. Suppose that this receiver decodes one 
user at the time, starting with user %k and progressing down to user i\. Let us consider the first 
decoding step. By assumption, receiver {ii-.-ix} can decode information in Wi K . . . ix}) so, 
given the side information Wi K \ Wi K . . - ir}) it has full knowledge of Wi K , it can compute the 
codeword Xj K transmitted by user ir and subtract it from the aggregate received signal, obtaining 
Yi 1 ___i K — X-i K = Yj 1 ...j K _ 1 . Thus, at the end of the first decoding step the channel output observed by 
receiver {i\ . . . Ir\ is statistically equivalent to Yj 1 ...j Jf _ 1 . It follows that at the next decoding step it can 
decode information in Wi K _ 1 . . . ir-i})- By proceeding this way, at the rth iteration we obtain a se- 
quence which is statistically equivalent to Yj 1 ...j K r+1 . Hence, receiver {ii . . . ix} can decode information 
in Wi K _ r+1 . . . ix-r+i}), then make use of the side information WK-r+i\Wi K _ r+1 ({ii ■ ■ ■ ix-r+i}) 
to compute Xj K _ r+1 and subtract it from the aggregate received signal before turning to decoding the next 
user. In other words, at the rth step of the iteration user ift-_ r+ i's signal is only subject to interference 
from users h,... ,ik-r> as the signal of the remaining users has already been canceled. Therefore, user 
ik-r+i communicates to the receiver at a rate equal to rj fe _ r+1 ({«i . . . ik~ r +i})- 

In summary, equation ( fT9l ) says that the sum of the communication rates across the K iterations cannot 
exceed the mutual information between the channel inputs on the transmitters side and the channel output 
on the receiver side, regardless of the permutation on the set of users originally chosen. 

C. The throughput of a RAC 

Assume that each user is active with probability p, independently of other users, and that p is available 
to the encoders. In light of these assumptions, 

Definition 5.8: The maximum expected sum-rate, or throughput, of a RAC is defined as 

T(p, m) = max 

AC{l,...,m} i&A 

where the maximization is subject to the constraint that the rates should be in the capacity region ^ of 
that channel. 
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The fact that each user is active with the same probability p has one important consequence. By 
re-writing the objective function in (l20l ) as 

m 

fc=l AC{l,...,m} ieA 

|A|=fc 

and defining 

^ = X! Z r i(^)' fc G {1, . . . , m}, (21) 

AC{1 m} «eA 

|A|=fc 

it is clear that the objective function in d20l ) depends only on pi, ... , p m . It follows that in order to 
compute T(p, m) it is sufficient to characterize the optimal tradeoff among these m variables. This 
motivates the following definition 

Definition 5.9: Let ^ p denote the image of the capacity < ?f of an m-user additive RAC under the linear 
transformation given by (l2Tb . 

It should be emphasized that the symmetry of the problem allow us to greatly reduce the complexity of 
the problem: instead of characterizing "jf, which is a convex subset of Rip 2 " \ it suffices to study the 
set ffp, which is a convex subset of W£. Thus, we have that 

m 

T(p, m) = max W(l- p) m ~ V (22) 

In the sequel, outer and inner bounds on are denoted by % p and ^ p respectively. In what follows, 
we denote by 

the probability of getting exactly k successes in m independent trials with success probability p, and we 
denote by 

k 

F m ,k(p) = 22fm,i(p) 

the probability of getting at most k successes. 

VI. Example 1: the tti-user symmetric BD-RAC 

In this section, we consider the m-user generalization of the symmetric BD-RAC considered in 
Section JIIJ where all transmitted codewords are shifted by the same amount. This model represents 
an approximation of a wireless channel in which signals are received at the same power level. 
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Suppose the X and y alphabets are each the set {0, 1}, the additive channel (TT5T ) is noise-free, so 
Za = 0, and the sum is over GF(2). Observe that this is the ?n-user version of the channel model in (O in 
the special case where n\ = . . . = n m = 1. The codeword length is normalized to 1. As mentioned above, 
this channel model can be thought of as a natural generalization of the packet collision model widely 
used in the networking literature, where packets are always encoded at rate one, so that transmissions 
are successful only when there is one active user. Theorem [5] yields the following proposition. 

Proposition 6: The capacity region of the ?n-user symmetric BD-RAC is contained inside the set 
of {ri(A)} tuples satisfying 

n{B) < ri {A) for all i G B C A, (23) 

and 

m 

5> fc ({ti...i*})<l, (24) 

k=l 

for all i\ 7^ . . . ^ i m G {1, . . . , m}. 



A. The throughput of the symmetric BD-RAC 

Next, we turn to the problem of characterizing the throughput T(p, m) for the symmetric BD-RAC. 
The following theorem provides the exact characterization of for this channel. 

Theorem 7: for the m-user symmetric BD-RAC is equal to the (pi, . . . ,p m ) tuples satisfying 

Pk pk+l _Pjn 

— n , -i \ / m \ — — m\ — ' \ ' 



and 



£ yk - L (25b) 

fc=l K \k) 



Proof: See Appendix III. ■ 
We outline the proof of the theorem as follows. The outer bound in the above theorem makes use 
of Proposition [6] To prove the achievability, we show that ^ p is equal to the image under the linear 
transformation given by (|2TI ) of the capacity region c toy^ of the m-user symmetric BD-RAC for the 
message structure W defined by 

Wi = {W i , 1 ,...,W i , m }, ie {!,..., m} (26) 
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and 

Wi(A) = Uj>\A\Wij, (27) 

for i G A C {1, . . . , m}. This message structure is the natural generalization of the message structure used 
for the two-user BD-RAC. Each sender transmits m independent messages, which are ordered according 
to the amount of interference which they can tolerate, so that message Wij is decoded when there are 
less than j interfering, regardless the identity of the interferers. 

To prove the achievability of Sfp using this message structure, we observe that ^ p is the convex 
hull of m extreme points, and that to achieve the kth extreme points it suffices that user i transmits a 
single information message, namely W^f., encoded at rate \. Thus, a simple single-layer coding strategy 
can achieve all extreme points of and the proof of the achievability is completed by means of a 
time-sharing argument. 

Having an exact characterization of ^ at hands, we can explicitly solve the throughput optimization 
problem. The main result of this section is given by the following theorem. 

Theorem 8: Let H m represent the partition of the unit interval into the set of m intervals 

Oo,Pl], (P1,P2], • • • , iPm-UPm], 

where po = 0, p m = 1 and, for < k < m, pk is defined as the unique solution in (0, 1) to the following 
polynomial equation in p 

-j^-jF m _ lyk (p) = - J P m _i )fc _i(p). (28) 

Then, the following facts hold 

1) Pi = h Pm-i = ml /(l-D , and p G (0, for k G {2, . . . , m - 2}. 

2) The throughput of the m-user symmetric BD-RAC is given by 

Trip 

T(p,m) = —F m _ lyk (p), if pe (pk-i,Pk\, (29) 

for k € {1, . . . , m}. 

3) T(p, m) is achieved when all active senders transmit a single message encoded at rate 

r (p) = t, if P € (Pk-i,Pk\, (30) 

for k G {1, . . . , m}. 

4) T(p, m) is a continuous function of p; it is concave and strictly increasing in each interval of the 
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partition U m . 

Proof: See Appendix IV. ■ 
Remarks: The above theorem says that T(p, m) can be achieved by a coding strategy which does 
not require simultaneous transmission of multiple messages. Instead, each active user transmits a single 
message encoded at rate r(p). Inspection of (l33l ) reveals that r(p) is a piecewise constant function of 
p, whose value depends on the transmission probability p. If p is in the kth interval of the partition 
n m , then r(p) is equal to \, Similarly, the corresponding achievable throughput T(p,m) is a piecewise 
polynomial function of p. The boundary values of the partition, denoted by the sequence {pk}, are given 
in semi-analytic form as solutions of (|28T ). and closed form expressions are available only for some special 
values of m and k. Nevertheless, Theorem [T3l provides the upper bound p& < —. 

The structure of the solution is amenable to the following intuitive interpretation. Based on the 
knowledge of m and p, transmitters estimate the number of active users. More precisely, if p is in 
the kth interval of the partition II m , i.e., pu-i < p < then transmitters estimate that there are k 
active users. Since p^ < — , it is interesting to observe that the computed estimator is in general different 
from the maximum-likelihood estimator \mp\. Then, they encode their data at rate r, that is, each user 
requests an equal fraction of the £>user binary MAC sum-rate capacity. Clearly, there is a chance that the 
actual number of active users exceed k, in which case a collision occurs. Vice-versa, the scheme results 
in an inefficient use of the channel when the number of active users is less than k. However, this strategy 
represents the right balance between the risk of packet collisions and inefficiency. 

It is interesting to note that when p < pk~i the optimal strategy consists of encoding at rate 1, i.e., at 
the maximum rate supported by the channel. As already remarked, this is the coding strategy used in the 
classic ALOHA protocol. Notice that since pi = —, this strategy is optimal when the probability of being 
active is less that the inverse of the population size in the network. In this case, there is no advantage in 
exploiting the multi-user capability at the receiver. On the other hand, for p > -j-, the throughput of an 
ALOHA system is limited by packet collisions, which become more and more frequent as p increases. 
In this regime, the encoding rate has to decrease in order to accommodate the presence, which become 
more and more likely as p increases, of other potential active users. 

B. Throughput scaling for increasing values of m 

If we let the population size m grow while keeping p constant, the law of large number implies that 
the number of active users concentrates around mp, so one would expect that the uncertainty about the 
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number of active users decreases as m increases. This intuition is confirmed by the following corollary, 
which states that the probability of collision tends to zero as m grows to infinity. 

Corollary 9: Let p £ (0, 1). Then, lim m _^ OQ T(p,m) = 1. 

So far, we have been assuming that p does not depend on m. Assume now that the total packet arrival 
rate in the system is A, and let p = ~ be the arrival probability at each transmitting node. Let T(A) 
denote the throughput in the limit m — ► oo. Then, by applying the law of rare events to d28l) and (|29l ) 
we obtain the following corollary to Theorem [8] 

Corollary 10: Let Ao = 0, Aqo = oo and, for < k < oo, let A& be defined as the unique solution in 
(0, oo) to the following polynomial equation in A 



where V(k + 1, A) is the incomplete gamma function. Then, as m tends to infinity, the throughput is 
given by 



for k G Z. The rate which attains the throughput is given by r(A) = r, if A € {Xk-i, A&], fceZ, Finally, 
T(A) is a continuous function of A; it is concave and strictly increasing in each interval (Afe_i, A&], and 
liniA^oo T(X) = 1. 

Note that the claim above is in striking contrast with the throughput scaling of the classic slotted 
ALOHA protocol. The throughput of slotted ALOHA increases for small A, it reaches a maximum e _1 
at A = 1/m, after which it decreases to zero as A tends to infinity. See Fig. [6] for a comparison between 
T(A) and the throughput of standard ALOHA as a function of A. 

VII. Example 2: the m-user symmetric AWGN-RAC 

We now turn to another important example of additive channels. Suppose that the codewords generated 
by the m encoders are composed by n random variables taking values over the reals, and whose 
realizations satisfy the following average power constraint 



for some positive constant P. Observe that we focus on the symmetric case in which all users are subject 
to the same received power constraint. Furthermore, suppose that {Za} in (fl"5l) are independent standard 



1 



r(jfe + l,A) =T{k,X) 



k + l 



T{\) = — r(fc + l,A), if A G (Afc_i, Afc], 



n 
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0.7 




10 



A 



Fig. 6. Comparison between T(A) and the throughput of the slotted ALOHA protocol 

Gaussian random variables, and that the sum in (U~5l) is over the field of real numbers. Applying Theorem 
|5J we obtain the following proposition. 

Proposition 11: The capacity region ^of the m-user symmetric AWGN-RAC is contained inside the 
set of {ri(A)} tuples satisfying 



for all K G {1, ... , to} and i\ ^ . . . 7^ i m G {1, . . . , to}. 

A. An approximate expression to within one bit for the throughput 

Next, we turn to the problem of characterizing the throughput T(p, to, P) for the symmetric AWGN- 
RAC as a function of the transmission probability p, the population size to, and the available power P. 
First, we provide inner and outer bounds on ^ p for this channel. 

Theorem 12: Let % p denote the set of rates {pk} £ K m such that 



n{B) < n(A) for all i G B C A, 



and 



A' 



5> fc ({*!... i fc })<C(iCP) 



fe=l 




> 
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and ^ 

for all K G {1, ... , m}. Let denote the set of rates {p k } G W 71 that satisfy (I25ab and 

m 

1 Pi i \ " ( __k fc-1 ^ Pfc < 1 

CIPJ J^J + 2_y \C{kT) C({k-l)P)J fc jm) ^ A - 

Then, tf, C % C ¥ p . 

The proof of the above theorem is omitted since it closely follows the proof of Theorem |7J As for the 
case of the BD-RAC, the achievable region in the above theorem is obtained by considering the message 
structure defined by d26l) and (|27T ) and the coding scheme we utilize does not require the use of Gaussian 
superposition coding. 

In virtue of Theorem [12] it is possible to bound T(p, m) as 

T(p, m, P) < T(p,m) < T(p,m,P), 

where lower and upper bounds are given by (1221 after replacing c tf p ^ m with c €_ p m and ^ p ^ m respectively. 
The following theorem provides an expression for T(p, m, P). 

Theorem 13: Let n m (P) represent the partition of the unit interval into the set of m intervals 

(po(P),Pi(P)],...,(p ro -i(P) ) p m (P)], 

where po(P) — 0, p m (P) = 1 and, for k G {1, . . . ,m — 1}, Pk(P) is defined as the unique solution in 
(0, ^) to the following polynomial equation in p 

C((fe + l)P) C(fcP) 

J *m-l,fc(p) = ^ *m-l,fc-l(p)- (31) 

Then, T(p, m, P) is a continuous function of p, concave, strictly increasing in each interval of the partition 
II m (P), and is given by 

T(p,ro,P) = ( ^p-mpF m _ hk _ l (p), if p G (p fc _i(P),p fc (P)], (32) 

for fc G {1, ... , m}. To achieve T(p, m, P), it suffices that each active user transmits a unique message 
encoded at rate 

r(p,m,P) =^j~t ifp€ (p fc -i(P),pjfe(P)], (33) 
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for ft € {1, ... , m}. 

The proof of the above theorem is omitted since it closely follows the proof of Theorem |7J Similarly 
to what stated by Theorem [7] for the BD-RAC, the above theorem says that T(p, m, P) can be achieved 
by a coding strategy which does not require superposition coding: each active user transmits a single 
message encoded at rate r(p,m,P). Both r(p, m, P) and T_(p, m,P) are piecewise constant function of 
p, whose value depends on the transmission probability p. 

The coding scheme used to achieve T_(p, m, P) for the symmetric AWGN-RAC is similar to the one 
used to achieve the throughput of the symmettic BD-RAC: based on the knowledge of m and P and p, 
transmitters estimate the number of active users. More precisely, if p is in the ftth interval of the partition 
II m (P), i.e., pfc_i(P) < p < Pk(P), then transmitters estimate that there are k active users. Then, they 
encode their data at rate ^C(ftP), that is, each user requests an equal fraction of the ft-user AWGN MAC 
sum-rate capacity. 

A natural question to ask is how close this scheme is to the optimal performance. To answer this 
question, we first need to provide an expression for T(p,m,P). This is done in the next Theorem. 
Theorem 14: Let Il m represent the partition of the unit interval into the set of m intervals 

(po,Pl], • • • , (jPm-l,Pm], 

where po = 0, p m = 1 and, for every ft & {1, ... , m}, pk is defined as the unique solution in (0, ^) to 
the following polynomial equation in p 

rr7^i,*(p) = jFm-i,k-i (p). (34) 
k + 1 k 

Then, T(p, m, P) is a continuous function of p, concave and strictly increasing in each interval of the 
partition n m (P), and is given by 

m 

T(p,m,P) =mp ^] Vk,iF m -i,i-i (p) if p € (pjfc-i,Pfc], (35) 



i=i 



for ft G {1, ... , m}, where 



wi, 



2C(2P)-C(P), i = l, 

2C(iP)-C((i + l)P)-2C((i-l)P), ie{2,...,m}, (36) 
C(mP) — C((m — 1)P), i = m, 
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For k G {2, . . . , m - 2} 



Vk,: 



0. 

^ic(fcP)-c((fc + i)p), 

2C(iP) - C((t + 1)P) - C((t - 1)P) 
C(mP)-C((m-l)P), 



For k 



m 



Vm—1A 



0. 



^ I C((m - 1)P) - C(mP) 
C(mP) -C((m- 1)P), 



i G {l,...,fc- 1}, 

i = k, 

i G + 1, . . . , m — 1}, 

i = m, 

i G {l,...,m-2}, 

i = m — 1, 
i = m, 



(37) 



(38) 



For 



m 



(39) 



0, ie{l,...,m-l}, 
^C(mP), i = m. 

Proof: See Appendix V. ■ 
The proof of the above theorem is conceptually simple but technical, as it requires finding the analytic 
solution of a linear program. Comparing the statements of Theorems [13] and [Q] one can observe that the 
basic structure of T(p, m,P) and T(p, m, P) is the same. As opposed to the sequence {pk(P)} defined 
in Theorem [TBI the sequence {p^} in Theorem [141 does not depend on the power P. It is easy to see 
that Pfc(P) < Pk < k/m, for every k. Furthermore, the sequence {p^} defined in Theorem [141 is equal 
to the sequence defined in Theorem [7] By directly comparing T(p,m,P) and T_(p, m,P) we obtain the 
following result. 

Theorem 15: Let p G (0, 1], m > 2 and P > 0. Then, 

T{p, m, P) — T(p, m, P) < 1. 

Proof: See Appendix VI. ■ 
The above theorem says that our suggested coding scheme achieves an expected sum-rate which is 
only 1 bit away from the optimum, independently of the values of p, P and m. It it remarkable that the 
gap does not increase with the population size of the system. Thus we conclude that transmitting at rate 
^C(fcP) when p is in the kth interval of the partition n m (P) represents the right balance between risk of 
collision and efficiency: encoding rates above ^C(kP) would increase the collision probability, yielding 
a decrease in the expected sum-rate. Viceversa, rates lower than ^C(fcP) would result in an inefficient 
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Fig. 7. Upper and lower bounds on the throughput of a four-user symmetric AWGN-RAC and encoding rate achieving the 
lower bound, as a function of the transmission probability p (P = 15 dB). 



use of the channel. 

Fig. |7] shows plots of T(p, m, P), T(p, m, P), and r(p, m, P) for the case of networks with four users. 
Observe that the T(p, m, P) is a piecewise concave function of the transmission probability. 

B. Comparison with other notions of capacity 

The expression for the throughput derived in the previous section can be compared to similar expres- 
sions obtained assuming other notions of capacity. A natural outer bound is given by the throughput 
achieved assuming that full CSI is available to the transmitters. In this case, the sum-rate of the k- 
user AWGN-MAC can be achieved whenever k users are active. Averaging over the message arrival 
probability, we obtain the following expression for the throughput: 

m 

T CS l(p, m,P)±Yl fm,k(p)C(kP). (40) 
fc=i 

On the other hand, if we study the symmetric AWGN-RAC following the adaptive capacity framework 
as in lH4l . then each transmitter designs a code which has to be decoded regardless the number of active 
users. This is a conservative viewpoint and forces each user to choose a rate of l/mC(mP) so that users 
can be decoded even when all m transmitters are active. Thus, we obtain 

T AD (p,m,F)±pC(mP). (41) 
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Fig. 8. Throughput of the symmetric AWGN-RAC with m — 25 users (P = 20dB). 



Fig. [8] compares the obtained bounds on T(p, m, P) for the case m = 25 and P = 20dB to the throughput 
under the adaptive -rate framework (|4TT >. and assuming full CSI available to the transmitters (l40l ). 

Finally, observe that in order to achieve T(p, m, P) transmitters have to estimate the number of active 
users by solving the polynomial equations ( f3TT >. A natural question to ask is what is the achievable 
throughput performance if a maximum-likelihood estimator for the number of active user is used instead. 
Consider the following strategy. Suppose that, based on the knowledge of m and p, and assuming no 
prior on the number of active users, transmitters compute Uml, the maximum-likelihood estimator for the 
number of active users, and encode their data at rate C(/cmlP)/^ml- Since the most probable outcome 
of (m — 1) Bernoulli trial j^] with success probability p is the integer number between mp — 1 and mp, 
we have that Uml = [mp\. Thus, we obtain the following expression for the expected sum-rate capacity: 

T ML {P, m, P) 4 ^-C(k M LP)F m -i,k ML -i(p). (42) 

Fig. [9] compares T_(p, m,P) and (l42l ) for the case m = 25 and P = 20dB. We remark is that the ML 
estimator for the number of active users result in a strictly suboptimal throughput performance. 

'Each active transmitter estimates the state of the remaining (m — 1) users. 
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P 

Fig. 9. Throughput performance of the proposed estimator vs ML estimator for the number of active users (P = 20dB, 
m = 25). 

VIII. Discussion and practical considerations 

In networking, much research effort has been put in the design of distributed algorithms where each 
agent has limited information about the global state of the network. The model we developed in this paper 
allowed us to focus on the rate allocation problem that occurs when multiple nodes attempt to access 
a common medium, and when the set of active users is not available to the transmitters. Our analysis 
has lead to a distributed algorithm which is easily implementable in practical systems, and which is 
optimal in some information-theoretic sense. The rule of thumb which we have developed is that, upon 
transmission, senders should estimate the number of active users according to a prescribed algorithm 
based on the knowledge of the population size and the transmission probability m, and then choose the 
encoding rate accordingly. 

In this paper we focused primarily on the problem of characterizing the throughput assuming perfect 
symmetry in the network, that is, the same transmission probability and received power constraint across 
users. The reasons for enforcing symmetry are twofold. First, throughput maximization is a meaningful 
performance metric only in symmetric scenarios. Second, it allows us to focus on random packet arrivals 
at the transmitters, and not on the different power levels at which transmitted signals are received by the 
common receiver. This set-up is a realistic model for uplink communications in power-controlled cellular 
wireless systems. Nevertheless, an interesting open question is how to apply the layering approach to the 
m-user AWGN-RAC with unequal power levels at the receiver, assuming that each sender only knows 
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its own power level and state. 

We made the underlying assumption that users can be synchronized, both at block and symbol level. 
In light of this assumption, a time-sharing protocol could be employed to prove achievability results. 
A simple way to achieve this partial form of cooperation among senders is to establish, prior to any 
transmission, that different coding schemes are used in different fractions of the transmission time. 
However, in practice achieving such complete synchronization may not be feasible. An interesting open 
question is to characterize the performance loss due to lack of synchronism. In this case, the resulting 
capacity region need not be convex, as for the collision model without feedback studied by Massey and 
Mathys lfT6l . 

We also assumed that the receiver has perfect CSI, that is, it knows the set of active users. The question, 
relevant in practice, of how the receiver can acquire such information is not discussed here, and we refer 
the reader to the recent studies of Fletcher et al. ifTTTl . Angelosante et al. [4:], and Biglieri and Lops Q, 
which address the issue using sparse signal representation techniques and random set theory. 

Finally, in this paper the transmission probability p and the number of users m play a pivotal role 
in setting the encoding rate, and these quantities are supposed to be known at the transmitters. The 
probability p is determined by the burstiness of the sources, while m has to be communicated from the 
receiver to the transmitters. In practice, our model applies to communication scenarios in which the base 
station grants access to the uplink channel to m users, but where only a subset of these users actually 
transmit data. 
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Appendix I: Proof of TheoremH] 

Observe that ^ 2 * s a polytope in defined as the intersection of eight hyperplanes, two of which 
representing non-negativity constraints. By the Weyl-Minkowski theorem, c €_ 2 is the convex hull of finitely 
many rate vectors. It is tedious but simple to verify that 

^ 2 = conv{vi, . . . ,vh} 
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conv < 



U 




P(T> \ 

c ( p i) 




U 




C( p i) 




c ( p i) 




U 




C (P 2 +J 




c ( p i) 












C(P 2 ) 




C(Pa) 









C(Pa) 




C(P 2 ) 














5 










; 


C(Pi) 












J 






























C(P 2 ) 




C(P 2 ) 











" C(Pi) " 




C(p&t)" 




" c(po " 




" C(Pi) " 




" C(Pi) " 












C(P 2 ) 




C(P 2 ) 




C(P 2 ) 




C(P 2 ) 


> 


C(Pi) 




C(Pi) 




C(p^t) 




C(f&i) 


















C(pSt). 




. W . 


















(43) 



By convexity, it suffices to show that for every i G {1, . . . , 14}, there exists an achievable rate vector 
r-j such that d(vj, Tj) < 1. It is straightforward to verify that, for every £g{1,...,11}, VjG U ^2- 
Thus, d(vj, rj) = for all z G {1, . . . , 11}. 

Consider the rate vector ri2 G obtained by setting equality sign in the inequalities (fT4b with 
ft = g and ft = 1, i.e., r 12 = [C(P 2 ) + C (f^ ) ,C(P 2 ),C (f^) ,0] T . We have that 



d(vi2,ri 2 ) < 



C(Pi)-C(P 2 )-C 



2P 2 + 1 



+ 



c(p^)-c 



P1-P2 
2P 2 + 1 



1 , I PiP 2 - P9 
log 1 + 1 

PiP 2 - P| 



2 


1 . 2P 2 + 1 


+ 


- log 


2 S P 2 + l 



< 



1 2PiP 2 

— log ^ 

2 B P!P 2 -P2 



+ 



1 . 2P 2 + 1 

— log — 

2 & P 2 + 1 



< 



1 



V2' 



(44) 



Next, consider the rate vector ri3 = [C(Pi), C(P 2 ), 0, 0] T G %[ 2 ", obtained by setting equality sign in the 
inequalities (fT4l with 0i = 1 and /3 2 = 1. We have that 

1 



Pi+i/ 



< 



V2" 



d(v 13 ,ri3 

Finally, the distance between v 14 and r 12 can be bounded as follows 

d(vi4,ri2) < 



(45) 



C(Pi) - C(P 2 ) - c 



Pi -Pi 



2P 2 + 1 



+ 



c( 



Pi ' 
P2+1. 



c 



2P 2 + 1 



+ 



< 



(46) 



Combining (0U), (@3]>, and (J46J) we conclude that d(vj,r-j) < i G {12, 13, 14}, which concludes the 



December 16, 2009 



DRAFT 



proof. 



36 



Appendix II: Proof of Theorem[5] 

Inequalities (TT8T ) follow immediately from assumption Al.. Next, fix i\ / 12 7^ 
By Fano's inequality, we have that, for all r 6 {1, ... , m}, 



^ i m G {!,... 



if U W ik ({i 1 ...i r })\Y il ... ir <ne n , 



k=l 



where e n — > in the limit of n going to infinity. In particular, d47T ) implies that 



(47) 



H(W ir ({ii...i r })\Yi^ Ar ) <ne n . 



(48) 



Let K G {1, . . . , m}. Then, the following chain of equalities holds: 



A 



n 



J2r ik ({ii...i k }) 



k=l 



A 



H(uW ik ({h...i k }) 



A 



H(uW ik ({h-..ik}) 



A 



j U i {W ifc \W ifc ({ti...ifc})} 



A' 



/(uW it ({ ll ...^});Y 1 ,, K 



A 



u {vw ifc \w ifc ({ii...i fc })} 



fc=i 



A 



+ ^( j u i w ifc ({ii...i fc }) 



A" 



Y i ,. lK ,U i K,\W lt ({i 1 ...i t })} 



(49) 



The first term in the right hand side of (1491 ) can be upper bounded as follows 



[uW lk ({i 1 ...i k });Y ll ... lK 



5{w ijb \w ijb ({i 1 ...i fc })} 



H(Yi 



< H (Yi 



H Y 



H Y 



K 

u W ik 
k=i 

pw lk ,x tl ,...,x tK 

k=l 



I (-^M> • • • ' Xiictf Yii...i K ,t) 



t=l 



(50) 



where we use the fact conditioning reduces the entropy and the memoryless property of the channel. On 
the other hand, application of the chain rule on the second term at the left hand side of (1491 yields 



[U i W ih ({i 1 ...i k }) 



A 



Yi 1 ... ijt) ^ 1 {W ik \Wi 4 ({ii...u})} 
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J2 H (w ir {{h ...i r }) Y h ... iK , uJWi, \ W lk ({k . . . i fc }) } , fe= U +i W lk ({h . . . i k }) ) 

r=l ^ ' 

Y / H(w ir ({i 1 ...i r }) Jj {W ik \W ik ({i 1 ...i k })} *W ih ) 

* — ' \ k=l k=r+l J 

r=l v ' 

J2H(w ir ({i 1 ...i r }) Y il ... jK , fc UjW,a^({ii...i^)}, fc J +i W lfc , fc JJ +i X Jfc ') 

r=l ^ ' 



(51) 



K 



r=l 
K 



Y J H[W iT {{h---ir}) 



U {WuVWUfa.-.i*})} 



fc=i 



£#(W ip ({ii...i P })|Y il ... i J 



r=l 



(52) 
(53) 

where (|5TT) uses the fact that Xj fc is a function of W^, (1521) uses the fact conditioning reduces the entropy, 
and (O follows from (|48T ). 

Therefore, substituting (l50l) and (l53l) into d49l ), we obtain that 

K n 

n /, r ik{{h ■ ••«&}) < yjl^i^t, • • • ,X iKjt ;Y il „,i Kjt ) + nKe n , (54) 
fc=i t=i 

and the claim is completed by introducing a standard timesharing random variable and letting the block 
size n tend to infinity. 

Appendix III: Proof of Theorem[7] 

Let V denote the convex subset of M m described described by inequalities (I25ab and (125bl) . First we 
prove the converse part, by establishing that ^ C V. As a first step, we derive a useful identity. Let 
k E {1, . . . , m}. Then, 

r ik ({i 1 ...i k }) = (m- k)\ ^ r ik {{i x . . . i k }) 

iif ■■^im£{l,-,ni} ii^— 7^ifce{l,...,m} 



(m-k)\{k-l)\ ^ ^2n(A) 

AC 

(m - fc)!(fc - l)!p fc , 



AC{l,...,m} ieA 
|A|=fc 



(55) 



where the second equality uses the fact that ri k [i\ . . . i k ) = ri k ({i ai , , i k }) for any permutation 
<t over the set {1, . . . , k — 1}. Now we can establish the necessity of (I25bb . It follows from d24l) that the 
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following inequality has to hold 

in 

^2n k (h...i k ) < 1, (56) 

k=l 

for all i\ ■ ■ ■ 7^ i m G {1, . . . , to}. By summing both sides of d56l ) over all permutations over the first 
m integers, we obtain 

m 

^r ifc ({n...i fc })<m!. (57) 

ii^---^i m e{l,...,m} fc=l 

By means of (IB3T ). (fSTb can be re-written as 

m 

^(m - k)\(k - l)\p k < ml (58) 

fc=i 

Dividing both sides of (I58T ) by to,!, we conclude that (I25bb is a necessary condition for the achievability 
of a rate vector p. 

Next, note from (|23T > that ri(A) > rj(I?) for all i € A C B C {1, . . . , m} is a necessary condition to 
the achievability of a rate vector {ri(A)}. By summing these inequalities over all B having cardinality 
I A I + 1, we obtain that 

n{A)>— Yl ^ 

m — \A\ 

1 1 B:ieACBC{l,...,m} 
\B\=\A\+1 

Next, observe that, for every k E {!,... , to — 1}, 



p*^ E E r ^) 

A:AC{l,...,m} i&A 
\A\=k 

m 

= E E *w 

i=l A:AC{l,...,m} 
j G A, |A|=fe 



^ E ^ E 

i=l A:AC{l,...,m} B:iGACBC{l,...,m} 
iGA, |A|=fe |B|=fc+l 

= ^E E E n(B) (61) 

i=l B:BC{l,...,m} A:ACB 
ieB,|B|=fc+l *eA, |A|=fc 
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m — k 

i=l B:BC{l,...,m} 
i£B,\B\=k+l 

-Pk+i (62) 



m — k 

where (l60l) follows from d59l ), while (l6Tb is obtained observing that there are k subsets of B which have 

(m-l)\ 

\k-iy. 



cardinality k and contain the element i. After multiplying right and left hand side of (l62l) by ^ m ^ 



and rearranging the terms, we obtained the desired inequality 



Pk > Pk+l 



which proves (|25a| ). In summary, we showed that inequalities (|25ab and d25bb are necessary conditions 
for the achievability of a rate vector p, i.e., ^ C V. 

Next, we prove the achievability of V, establishing the reversed inclusion V C To do so, it suffices 
to show that the extreme points of V are achievable, as the rest of the region can be achieved my means 
of a time-sharing protocol. We claim that 

k 



V = conv< 



1 i — ^ i m "i 



i=l 

where the vector ej denotes the ith unit vector in W 71 . To see this, consider an invertible linear transfor- 
mation L : W n ->• M m given by 

x — — ^ — 



P Pk+ (64) 

x k /m\ ~ " 77 m \ ; ^ ^ {1, . . . , 771 — 1}. 

HfcJ ( fc+1 )U+iJ 

It is straightforward to check that the image V under L is given by the oriented m-simplex LV = 
{x € M.™ : ^2k"=i^ x k < 1} = conv {0, v' 1; . . . , v' m } , wherein \' k = ie^. Since L is invertible, 
the extreme points of V can be obtained by applying L _1 to the extreme points of LV. Thus, V = 
conv {0, pi , . . . , p m } where 



1 k 



fc G {1, . . . , m}. Hence (163T ) is proved. 

Next, we show that each rate vector p^ given by (l65l) is achievable. Consider the following message 
structure: 

Wi = {W i> i,...,W ijm } (66) 
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and 

u j>\A\Wi,j, i G A 



Wi(A) 



(67) 



It is immediate to verify that the above sets satisfy conditions Al., so the message structure is well 
defined. For every i, sender i transmits m independent messages {Wj,i, ■ ■ ■ , Wi, m } encoded at rates 
{Ri,i, • • • ,Ri >m }. For every k £ {1, . . . ,m}, the kth message Wik is decoded at receiver A if i G A 
and if \A\ < k, that is, if user i is active and there are less than k active users. To achieve the rate 
vector pk it suffices to set Ri^ = \ for all i, and the other rates equal to zero, that is, each sender % 
transmits a single message of information W^k encoded at rate i. Encoding is performed by means of 
a standard multiple-access random codebook. It follows from (l67l) that receiver A decodes Wi t k if i E A 
and \A\ < k. Thus, we have 

{I, i G A and \A\ < k; 
fc ' '- (68) 

0, otherwise. 

Observe that for every receiver A the sum of the rates of the decoded messages is at most 1. It follows 
that decoding can be performed by means of a standard fcuser multiple-access decoder. By plugging (l68l) 
into (1211 ). we obtain that 

' if i€ ,*} 

Pk,i = \ (69) 
0, otherwise, 



hence d65l ) is achievable. 



Appendix IV: Proof of Theorem [8] 



In order to prove the theorem, we first need to state two lemmas. The first lemma builds upon properties 

of the cumulative distribution function of the Binomial distribution. 

Us 



Lemma 16: Let k € {1, . . . , m — 1}. There exists a £ (0, ^) such that 



>o, 


p <pk 


= 0, 


p = pk 


< o, 


p> Pk 



iF m _ lifc _i(p)-— —F m _ ltk (p) < 



Proof: Define f(p) = ^F m _ ljfc _i(p) - ^F m _ ljk (p). The binomial sum F m _ 1)k _ x (p) is related to 
the incomplete Beta function by (TJ (6.6.4) page 263] 

fm-Lfc-iGO = i - r^-^i - ty^-Xdt. (7i) 

Jo 
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>o, 


p <pk 


f(p) < 


= 0, 


p = pk 




<0, 


p> Pk 



Substituting (T7TT > into the definition of f(p) and differentiating, we obtain the following expression for 
the derivative of / with respect to p f'{p) = -^jZp)f m -i,k{p) [ l ~PWi] ■ B y studying the sign of f'(p) 
one can see that f(p) is a strictly decreasing function of p in the range (0, ^-), reaches a minimum 
at p = and is a strictly increasing in the interval (^±i,l). We have /(l) = 0, and the Taylor 
expansion centered at p = 1 shows that f(p) increases to zero as p tends to one. Thus, / (^-) < 0. 
Note that /(0) > so, by the monotonicity of / and by the mean value theorem, there exists a unique 
Pk e (0, *±1) such that 



(72) 



To complete the proof, we show that Pfc < ^. Direct computation shows that p\ = 1/m, while for 
k G {2, . . . , m — 1}, we have that 

i ~~ fcTl) ^ ?m - 1 . fc - 1 (m) — feTl/ra-l>fc(m) 
< (fc — FPl) kfm-l,k-l{m) ~ h+lfm-l,k-l(m) 

= 0, (73) 

where the inequality follows from the fact that f m -i,i{— ) < /m-i,fe-i(~) for i G {0, . . . ,k — 1}, with 
equality iff i = fc-l, and that / ro _i, fc _i(^) = / m -i,fc(£) for A: G {2, . . . ,m - 1}. Thus, O and d2l 
show that pk < ^ as claimed. ■ 
Roughly speaking, the above says that to achieve the throughput the encoding rate has to decrease as the 
transmission probability increases. The second lemma shows that 1/k is the optimal encoding rate when 
p is in the /cth interval of the partition Il m (P). 

Lemma 17: Let k G {1, . . . , m}. Define po — and p m = 1 and let {pk}™Zi be as in Lemma [T6l 
Then, 

--F m _i jfc _i(p) > -F m _ij-_i(p), j G {1, . . . ,m}, (74) 

for p G [pfc-i,Pfc]. 

Proof: In virtue of Lemma [161 it suffices to show that pk < Pk+i, for k G {0, . . . ,m — 1}. As 
Pi G (0, 1/m], it follows that po < Pi- Next, suppose that k G {1, . . . , m — 1}. Lemma [T61 shows that 
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lF m -i,k-i(pk) = kTT F m-i,k{Pk) and that Pk e (0; -|). Thus, we have 

k^lFm-l,k(Pk) — T^Fm-ljk+lipk) 
= \F m -l t k-l{Pk) - ]f^2-fm-l,fc+l(Pfc) 

I + ^m-l,fc-l(Pfc) - ^(-F m -l,fe-l(pjfe) + F m _i jfc+ l(p fc )) 

> fc(fc+2) -^-l.fc-l(^fc) ~ TP& F m-\,k(pk) 

2(fe+l) |7i / \ 2(fc+l) j-, / \ 

k(k+2)^rn-l,k-l{Pk) ~ k lk+2)*m-l,k-l\Pk) 

= 0, (75) 

where the inequality uses the fact that F m -i,fc-i(pjfc) + F m ^ k+ i(p k )) < 2F m ^ k (p k ) for p < k/m. 
Comparing (PTOl ) and (I75T ). we obtain the desired inequality p k < p k +i- ■ 
Using the above lemma, it is immediate to prove theorem [8] 

Proof: Observe that the optimum value of a linear program, if it exists, is always achieved at one 
of the extreme point of the feasibility set. Thus, (l63l) implies that 

k / \ 

I IT) \ 

pY 



Tip, m, P) = max -Y^'l 

fce{i,...,m} fc ^ \i J 



1 

max mp-F m _i 
fce{l,...,m} k 



fc=i 

where the last equality follows from Lemma [TT] 



Appendix V: Proof of Theorem [TJ] 
Let Cfc = C(kP). In order to evaluate T(p, m, P), it is convenient to make the change of variable 



x m — 7mYi 

* (76) 
^A; = , /m\ ~ T. — 777 Vn. \ ; fe € {1, . . . , m — 1}. 

fe UJ ( fc+1 )U+iJ 

Substituting the new variables into (l22l ). d25a|) , and d25b| ) and performing a modicum of algebra, we 
obtain, 

m 

T(p,m,P) = max mpV Fm-l,i-l(p) ( 77 ) 
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where ^ Xjm denote the set of rates {xk} G K+ such that 

K-i m 

kx k + K^x k <c K (78) 

k=l k=K 

for every K G {1, . . . ,m}. Observe that the optimum value of the linear program (TTTb is achieved at 
one of the extreme point of the feasibility set. Therefore, to prove the theorem it suffices to show that 



{ v fc}feli as defined in 



are extreme points of ^x.m, and that the objective function in (1771 ) reaches 



a strict local maximum at when p is in the kth interval of the partition n m (P). 

For every k G {1, . . . ,m}, it is straightforward to check that v& satisfies (|78T ) for K G {k, . . . ,m}, 
and that has k — 1 zero components. Thus, we conclude that is an extreme point of ff^m. 

Next, we establish that if p G \pk~iiPk\ where {pk-i} are defined in Lemma [T6l then the objective 
function reaches a local maximum at v^. We proceed by showing that the objective function at 
is strictly greater than at any of its neighboring extreme points. By definition, two extreme points are 
neighbors if they are connected by an edge. It is possible to show that has exactly m neighbor extreme 



points, which we denote by < n 



.(*)V 



is not reported here. For k E {1, . . . ,m — 1}, we have that 
. If j G {!,..., k- 1}, then 



. The proof of this fact is straightforward albeit fairly lengthy, so 



(fc) 



k 1_ 

0, 

l 



k-j+l 
j(k-j) C k 

Vk,i, 



k~j C 3 



* = J, 

i €{!,... 

i = k, 

i G {k + 1 



,j-l}U{j + l, 
, . . . , m} 



k-1}, 



(79) 



(k) 

If j = k, then nj = v k+1 . 
If j G {k + 1, . . . , m - 2}, then 



n 



(fe) 



1 

0, 



Cj-2 - 2 C J'+1' 



1. 



Co 



i+2 



2 C i-!' 



Z = J 
« = j, 

i = 3 + 1 
i G{1,.. 



(80) 



,j-2}U{j + 2,...,m} 



December 16, 2009 



DRAFT 



44 



If j = m — 1, then 



n 



m—lA 



3 1 

2 C m — 3 2*" m ' i = 7TI 2, 

0, i = m — 1, 

2^m 2^ m— 2' ^ — ^) 

Ufc.i, i G {1, • • • ,m - 3} 



(81) 



Finally, if j = m then 



c m -i-c m _2, i = m-l, 
0, i = m, 

Vk,i, i € {1, . . . ,m- 2}, 

On the other hand, for = m and j G {1, . . . , m — 1}, we have that 



(82) 



n 



(m) 



m .p. l_ r 



j(m-j) 1 m-j 



1 



(83) 



m-j 



m-j °J ' 



c,-, i = m, 



It can be immediately verified that |nj ' j as defined above are extreme points of ^x,m> and neighbors 
of v ft . 

Next, we establish that the objective function in (1771 ) reaches a local maximum at by comparing 
the value achieved at to the one at its neighboring extreme points. First, suppose k G {1, . . . ,m — 1}. 
• If j G {I, . . . , k — 1}, we can observe, from plugging ( |79T > into (1771 ) and performing some algebraic 
manipulations, that 

m 



1=1 

= Fm-i.fc-iCp) 



1 



J 



k-j k(k-j) 



-c k )-F m ^ 1 {p)\jj—- ) c 3 k _ j 



k 



1 



> F m _i fc _i(p) 



0. 



1 



k — j k(k — j) J k 



°k ) - r-Fm-l.fc-lCp) 



A; 



1 



- j) k-j 



because Fj-i < r-Fm-i,fc-l(p) if P i s i n the kth interval of the partition n m (P). 
If j = k, then 

m 

mp^2{v k:i - njJ]).F TO _i j i_i(p) 
i=l 
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1 fk + 1 \ (\ , . 1 

Cfc - Cfc_i T F m-l,k-l{P) ~ l—T F m-l,k{P) 



Cj-i + Cj+i \ ( ^ Fm-l,fc-l(p) + ^m-l,fc+l(p) 



fc + iV & * \k k + i 

>o, 

• If j € {fc + 1, . . . , m - 1}, then 

i=i 

= 2(q- ^;^ )|F^(p) 

>o, 

• If j = m, then 

m 

mp^2(v k4 - n^).F m _ M _i(p) = (c m - c m _i) (F m _i, m _i(p) - F TO _i )m _ 2 (p)) 
i=i 

>0, 

Next, suppose k = m. Compare the utility function at v m and nn . 



m 


-n (fc) )F 

u m,t) r rn- 


i=l 




- 1 m— l,m- 


■1(P) (— 


\m — 


F m— l,m— 


.w( 1 


Vm-, 


0. 





-Cm - F m _ij_i(p) — -Cj rc,; 



m—j m{m—j) J \j{ m ~J) m—j 

-C m rm-l,m-\(P) —, :Cfc 



m — j m{m — j) J m \j{ m ~J) m — j 



Therefore, we have established that the objective function reaches a local maximum at and completed 
the proof. 



Appendix VI: Proof of Theorem [T"5l 
Let ^ = C(kP). For every k € {1, . . . , m}, if p £ iPk-i,Pk] we h ave that 



T(p,m,P) > mp^F^k^ip). (84) 
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In particular, equality holds in (l84l) when p G [max(pfc_i ,pk-i(P)), min(pfc, pfc(P))]. It follows that 

m 

T(p,m,P) -T(p,m,P) < mp^^i-Fm-i,,-!^) - mp- F m _i )fc _i(p). (85) 



i=i 



for p G (,Pk-iiPk\- To prove the theorem, we show that the right hand side of (|85T ) is upper bounded 
by one for every k € {1, . . . ,m}. First, we consider the case k = 1. By substituting (l36l) into (|85T ). we 
obtain that 



mp 



rap 



mp 
3 



^vi,i-P m -i,i-i(p) - C(P)F m _i )0 (p) 

m 

3(c 2 - ci)F m _i j0 (p) + y^(cj+i - Cj)f m -x,j 

3=1 

3 m 1 1 

^-^m-l,o(p) + ^ yfm-l, : 



3=1 



m—1 



fm,l(p) + J -^—fm,j+l 
3=1 ZJ 



J=2 



-^fm,l{p) + (1 - /m,o(p) - fm,l(p)) 



< 1 



where the second equality uses the fact that Cj+i — Cj < l/(2j), while the last equality follows from 
2/ m ,o(p) > /m,i(p) f° r P £ (0, 1/m]. Similarly, from (l37T > we obtain that, for every k 6 {2, . . . ,m — 1}, 



mp 



y^fc,i-Fm-l,i-l(p) - — 

m—1 

(Cfe - Cfc_i)F m _ ljfc _i(p) + (2Cj - Cj_l - Ci + l)F m _l,i_l(p) 



i=fe+l 



mp 



+( l,m-l(p)] 

m—1 \ 
C k - C fc _i + ^ ( 2c i ~ C i~l ~ °i+l) + C m ~ Cm-1 F m -l,k-l(p) 



i=k+l 



m m—1 

+ ^2 y~) (2Cj - Cj-i - Cj+i + Cm- Cm-i) f m -ij-i(p) 
j=k+l i=j 
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m 



mp ^2 ( 



C j ~ c j-l)frn-l,j-l{p) 



j=k+l 



in 



1 



< mp y 

j=k+l 



2(i - 1) 



fm-\,j-\(p) 



m—1 



j=k 3 



< 1 



The proof is concluded observing that we have T(p, m, P 



) = T_{p, m, P) when k = m. 
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